201 research outputs found

    Genome Organization and Gene Expression Shape the Transposable Element Distribution in the Drosophila melanogaster Euchromatin

    Get PDF
    The distribution of transposable elements (TEs) in a genome reflects a balance between insertion rate and selection against new insertions. Understanding the distribution of TEs therefore provides insights into the forces shaping the organization of genomes. Past research has shown that TEs tend to accumulate in genomic regions with low gene density and low recombination rate. However, little is known about the factors modulating insertion rates across the genome and their evolutionary significance. One candidate factor is gene expression, which has been suggested to increase local insertion rate by rendering DNA more accessible. We test this hypothesis by comparing the TE density around germline- and soma-expressed genes in the euchromatin of Drosophila melanogaster. Because only insertions that occur in the germline are transmitted to the next generation, we predicted a higher density of TEs around germline-expressed genes than soma-expressed genes. We show that the rate of TE insertions is greater near germline- than soma-expressed genes. However, this effect is partly offset by stronger selection for genome compactness (against excess noncoding DNA) on germline-expressed genes. We also demonstrate that the local genome organization in clusters of coexpressed genes plays a fundamental role in the genomic distribution of TEs. Our analysis shows that—in addition to recombination rate—the distribution of TEs is shaped by the interaction of gene expression and genome organization. The important role of selection for compactness sheds a new light on the role of TEs in genome evolution. Instead of making genomes grow passively, TEs are controlled by the forces shaping genome compactness, most likely linked to the efficiency of gene expression or its complexity and possibly their interaction with mechanisms of TE silencing

    Discovery of 42 genome-wide significant loci associated with dyslexia

    Full text link
    Reading and writing are crucial life skills but roughly one in ten children are affected by dyslexia, which can persist into adulthood. Family studies of dyslexia suggest heritability up to 70%, yet few convincing genetic markers have been found. Here we performed a genome-wide association study of 51,800 adults self-reporting a dyslexia diagnosis and 1,087,070 controls and identified 42 independent genome-wide significant loci: 15 in genes linked to cognitive ability/educational attainment, and 27 new and potentially more specific to dyslexia. We validated 23 loci (13 new) in independent cohorts of Chinese and European ancestry. Genetic etiology of dyslexia was similar between sexes, and genetic covariance with many traits was found, including ambidexterity, but not neuroanatomical measures of language-related circuitry. Dyslexia polygenic scores explained up to 6% of variance in reading traits, and might in future contribute to earlier identification and remediation of dyslexia

    Efficiency and Power as a Function of Sequence Coverage, SNP Array Density, and Imputation

    Get PDF
    High coverage whole genome sequencing provides near complete information about genetic variation. However, other technologies can be more efficient in some settings by (a) reducing redundant coverage within samples and (b) exploiting patterns of genetic variation across samples. To characterize as many samples as possible, many genetic studies therefore employ lower coverage sequencing or SNP array genotyping coupled to statistical imputation. To compare these approaches individually and in conjunction, we developed a statistical framework to estimate genotypes jointly from sequence reads, array intensities, and imputation. In European samples, we find similar sensitivity (89%) and specificity (99.6%) from imputation with either 1× sequencing or 1 M SNP arrays. Sensitivity is increased, particularly for low-frequency polymorphisms (MAF <5%), when low coverage sequence reads are added to dense genome-wide SNP arrays — the converse, however, is not true. At sites where sequence reads and array intensities produce different sample genotypes, joint analysis reduces genotype errors and identifies novel error modes. Our joint framework informs the use of next-generation sequencing in genome wide association studies and supports development of improved methods for genotype calling

    Dominance and the evolutionary accumulation of cis-and trans-effects on gene expression

    Get PDF
    Gene expression levels appear to be under pervasive stabilizing selection. Yet the genetic architecture underlying abundant gene expression diversity within and between populations remains elusive. Here, we investigated the role of dominance in the segregation of cis-and trans-regulation within and between populations. We used chromosome substitution lines of Drosophila melanogaster to show that (i) &gt;70% of the genes that are differentially expressed between two homozygous lines are masked in the heterozygous, suggesting that one of the substituted chromosomes contains a recessive allele; (ii) such large masking is already obtained with heterozygous chromosomes originating from the same population, with the time of divergence between chromosomes in heterozygous lines making only a small but significant contribution to the masking of variation observed in homozygous lines; (iii) variation in gene expression due to trans-regulation is biased toward greater deviations from additivity because of recessive and dominant alleles, whereas variation due to cisregulation shows higher additivity; and (iv) genetic divergence between second chromosomes is associated with increased cisregulation, whereas the level of trans-regulation shows little increase over the time scale studied. Our results indicate that cis-acting alleles may be preferentially fixed by positive natural selection because of their higher additivity, and that the disruption of gene expression by recessive variation with pervasive transeffects may be important for understanding gene expression variation within populations. We suggest that widespread regulatory effects of recessive low-frequency homozygous variation may provide a general mechanism mediating disease phenotypes and the genetic load of natural populations. genetic load ͉ genome architecture ͉ regulatory evolution ͉ recessive ͉ natural selectio

    Genome-wide association study of problematic opioid prescription use in 132,113 23andMe research participants of European ancestry

    Get PDF
    The growing prevalence of opioid use disorder (OUD) constitutes an urgent health crisis. Ample evidence indicates that risk for OUD is heritable. As a surrogate (or proxy) for OUD, we explored the genetic basis of using prescription opioids \u27not as prescribed\u27. We hypothesized that misuse of opiates might be a heritable risk factor for OUD. To test this hypothesis, we performed a genome-wide association study (GWAS) of problematic opioid use (POU) in 23andMe research participants of European ancestry (N = 132,113; 21% cases). We identified two genome-wide significant loci (rs3791033, an intronic variant of KDM4A; rs640561, an intergenic variant near LRRIQ3). POU showed positive genetic correlations with the two largest available GWAS of OUD and opioid dependence (

    Genome-wide association study meta-analysis of the Alcohol Use Disorders Identification Test (AUDIT) in two population-based cohorts (N=141,932)

    Get PDF
    OBJECTIVE: Alcohol use disorders are common conditions that have enormous social and economic consequences. Genome-wide association analyses were performed to identify genetic variants associated with a proxy measure of alcohol consumption and alcohol misuse and to explore the shared genetic basis between these measures and other substance use, psychiatric, and behavioral traits. METHOD: This study used quantitative measures from the Alcohol Use Disorders Identification Test (AUDIT) from two population-based cohorts of European ancestry (UK Biobank [N=121,604] and 23andMe [N=20,328]) and performed a genome-wide association study (GWAS) meta-analysis. Two additional GWAS analyses were performed, a GWAS for AUDIT scores on items 1-3, which focus on consumption (AUDIT-C), and for scores on items 4-10, which focus on the problematic consequences of drinking (AUDIT-P). RESULTS: The GWAS meta-analysis of AUDIT total score identified 10 associated risk loci. Novel associations localized to genes including JCAD and SLC39A13; this study also replicated previously identified signals in the genes ADH1B, ADH1C, KLB, and GCKR. The dimensions of AUDIT showed positive genetic correlations with alcohol consumption (rg=0.76-0.92) and DSM-IV alcohol dependence (rg=0.33-0.63). AUDIT-P and AUDIT-C scores showed significantly different patterns of association across a number of traits, including psychiatric disorders. AUDIT-P score was significantly positively genetically correlated with schizophrenia (rg=0.22), major depressive disorder (rg=0.26), and attention deficit hyperactivity disorder (rg=0.23), whereas AUDIT-C score was significantly negatively genetically correlated with major depressive disorder (rg=-0.24) and ADHD (rg=-0.10). This study also used the AUDIT data in the UK Biobank to identify thresholds for dichotomizing AUDIT total score that optimize genetic correlations with DSM-IV alcohol dependence. Coding individuals with AUDIT total scores ≤4 as control subjects and those with scores ≥12 as case subjects produced a significant high genetic correlation with DSM-IV alcohol dependence (rg=0.82) while retaining most subjects. CONCLUSIONS: AUDIT scores ascertained in population-based cohorts can be used to explore the genetic basis of both alcohol consumption and alcohol use disorders
    corecore